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We report a measurement of the branching fraction ratios R{D^*^) of B —>■ D^*\~Ut relative to 
'Vi (where £ = e or p,) using the full Belle data sample of 772 x pairs collected at 

the T(4S') resonance with the Belle detector at the KEKB asymmetric-energy e^e“ collider. The 
measured values are R{D) = 0.375 ± 0.064(stat.) ± 0.026(syst.) and R{D*) = 0.293 ± 0.038(stat.) ± 

0.015(syst.). The analysis uses hadronic reconstruction of the tag-side B meson and purely leptonic r 
decays. The results are consistent with earlier measurements and do not show a significant deviation 
from the standard model prediction. 

PACS numbers: 13.20.He, 14.40.Nd, 14.80.Da 

INTRODUCTION theoretically [3], and are used, for example, to extract the 

standard model (SM) parameter \Vcb\ [!]■ The replace- 


Semileptonic B —>• D^*'>£ vg decays [T], where £ = e 
or p, have been studied in detail, experimentally [2] and 
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merit of the light lepton by the higher-mass r leads to an 
increased sensitivity to new physics (NP) effects. In par¬ 
ticular, models with charged Higgs bosons [Si, whose 
couplings are proportional to mass and thus more pro¬ 
nounced for T leptons, predict measurable deviations of 
the branching fraction and kinematic distributions from 
SM expectations. The measurement of i? —>■ D^*'>T~Dr 
is challenging because the r must be reconstructed from 
its decay products that include one or more neutrinos. 

The first observation of an exclusive semitauonic B 
decay was reported by the Belle Collaboration in 2007 
in the channel —)■ [ 3 . Subsequent mea¬ 

surements by BaBar and Belle [SHlQ] reported branch¬ 
ing fractions above—yet consistent with—the SM pre¬ 
dictions. In 2012, a significant excess over the SM ex¬ 
pectation was reported by BaBar m that suggested the 
presence of NP; this called for an independent confirma¬ 
tion. Interestingly, the two-Higgs doublet model (2HDM) 
of type II, which might explain a deviation from the SM 
expectation in a (semi)tauonic B decay [3, is incompat¬ 
ible with this result. A recent LHCb measurement of 
B^ —>■ D*^T~Vr [H] also shows a 2.1cr deviation from 
the SM prediction. 

Measurements and predictions are usually quoted as 
branching fraction ratios 


and 


R{D) = 


B{B Dr-Pr) 
B{B Di-ve) 


R{D*) 


B{B D*T-Dr) 
B{B D*e-vi) 


( 1 ) 

( 2 ) 


to reduce experimental systematic uncertainties and the¬ 
ory uncertainties from form factors, where B{B —>■ 
D^*k-Vi) = [BiB D^*k-Pe)+B(B y.-p^)]/2. 

In Ref. in the calculations in Ref. m are used with 
updated form factor measurements to obtain the stan¬ 
dard model predictions R{D)sm = 0.297 ± 0.017 and 
R{D*)sm = 0.252 ± 0.003. More recent predictions of 
RiD)sM are 0.299 ± 0.011 [H] and 0.300 ± 0.008 
In this paper, we report new measurements of R{D) 
and R{D*) with the full Belle T(4S') — BB data set of 
711 fb“^. The r lepton is reconstructed in the leptonic 
decays t~ — >■ e~PeVT and t~ so that the signal 

and normalization modes have the same detectable final 
state particles. This reduces the systematic uncertainty 
in R but requires a method to distinguish the modes ex¬ 
perimentally. For this purpose, we exploit the kinematics 
of e+e“ —>■ T(4S') —>■ BB by reconstructing the accom¬ 
panying B meson, i?tag, in a hadronic decay mode and 
extracting the invariant mass squared. 


-^miss — {Pe+e- Ptag Pi) j (3) 


of all undetected signal-R meson daughters, where Pe+e- j 
Ptag, PD<-*h and Pi are the four-momenta of the collid¬ 


ing beam particles, the Btag candidate, and the recon¬ 
structed signal-R daughters, respectively. 

The distribution peaks at (above) zero for the 

normalization (signal) mode with one neutrino (three 
neutrinos) in the final state. The separation power is 
weaker for backgrounds where multiple final-state par¬ 
ticles are not reconstructed. We improve the rejection 
of such backgrounds by training a neural network to 
distinguish them from the signal in the high-M^j^^ re¬ 
gion. Since the low- and high-M^j^g regions are domi¬ 
nated by different backgrounds, the data sample is split 
at = 0.85GeV^/c'* and the subsamples are fit si¬ 

multaneously. In the low-M^jgg region, which is domi¬ 
nated by the normalization mode, we fit the M^jgg distri¬ 
bution; in the high M^jgg region, where the background 
with multiple missing particles contributes, we fit the 
neural-network output distribution. The analysis pro¬ 
cedure is developed and optimized with simulated data 
before applying it to the experimental data. 


BELLE EXPERIMENT 

This measurement is based on a data sample that con¬ 
tains 772 X 10^BB pairs, collected with the Belle detector 
at the KEKB asymmetric-energy e+e“ (3.5 on 8 GeV) 
collider [16] operating at the T(4S') resonance. The Belle 
detector is a large-solid-angle magnetic spectrometer that 
consists of a silicon vertex detector (SVD), a 50-layer cen¬ 
tral drift chamber (GDC), an array of aerogel threshold 
Cherenkov counters (ACC), a barrel-like arrangement of 
time-of-flight scintillation counters (TOF), and an elec¬ 
tromagnetic calorimeter comprised of CsI(Tl) crystals 
(ECL) located inside a superconducting solenoid coil that 
provides a 1.5 T magnetic field. An iron flux-return lo¬ 
cated outside of the coil is instrumented to detect 
mesons and to identify muons (KLM). The detector is 
described in detail in Ref. m- Two inner-detector con¬ 
figurations were used. A 2.0-cm beampipe and a three- 
layer silicon vertex detector was used for the first sample 
of 152 X lO^BB pairs, while a 1.5-cm beampipe, a four- 
layer silicon detector, and a small-cell inner drift chamber 
were used for the remaining 620 x 10^BB pairs [T8| . 


RECONSTRUCTION 

We reconstruct i?tag candidates using the hierarchi¬ 
cal hadronic full reconstruction algorithm |19] . which in¬ 
cludes 1149 B final states. The efficiency of the Rtag re¬ 
construction is 0.3% for B^ and 0.2% for B^ mesons [H]. 
Requirements on three observables are applied to en¬ 
hance the sample’s purity: the beam energy-constrained 
mass Mbc = \/A'beam ~ (Ptagc)^/c^ must lie between 
5.274 and 5.286GeV/c^, where Ebeam is the colliding- 
beam energy and ptag is the Btag momentum, both mea- 
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sured in the center-of-mass system (CMS); the absolute 
value of the energy difference Ai? = iiltag — ^'beam must 
be smaller than 50 MeV, where Et^g is the Stag CMS 
energy; and the full-reconstruction neural-network qual¬ 
ity estimator for Stag (which incorporates modified Fox- 
Wolfram moments [50] to suppress e+e” —>■ qq continuum 
events) must exceed a channel-dependent threshold that 
preserves « 85% of the B —>• D^*^T~Vr events. 

In each event with a selected Stag candidate, we search 
for the signature with £ = e or /r, among the re¬ 

maining tracks and calorimeter clusters. The four dis¬ 
joint data samples are denoted D°i~, D*^£~, 

and D*^i~. We reconstruct mesons in the decays to 
iF“7r+7r+, Kgir^, iFg7r+7r°, and i^^TT+Tr+Tr”; mesons 
to iF“7r+, i4r“7r+7r+7r“, iF“7r+7r°, Kgir^, and iFg7r+7r“; 
£)*+ mesons to D°7r+ and II+7r°; and D*^ mesons to 
D^tt^ and 

Charged-particle candidates are selected from tracks 
that originate from within 4.0 (2.0) cm along (perpen¬ 
dicular to) the beam direction of the interaction point 
(IP). Selections on the particle-identihcation likelihood 
ratio of the electron (muon) vs. the hadron hypothesis 
for the candidate lepton track retain 95 % (92 %) of signal 
events. We veto a D^*'> candidate if a charged daughter 
is lepton-like, with a signal efficiency of 97 %. Kg candi¬ 
dates are reconstructed from pairs of oppositely charged 
tracks, treated as pions, and must satisfy standard qual¬ 
ity requirements |21] . 

Clusters in the ECL with an energy of at least 50 MeV 
and no matching track are identified as photons. Can¬ 
didate 7r°’s are reconstructed from pairs of photons. For 
end-cap photons used in a 7r° candidate, the energy must 
be greater than 80 MeV. The momentum of 7r° candidates 
not originating from a D* decay must exceed 200 MeV/c 
in the signal-i? rest frame. The absolute value of the 
difference between the invariant mass of the tt^ can¬ 
didate and the nominal tt^ mass, normalized to its un¬ 
certainty, must be below 3.0. 

We select 7?+/° meson candidates with a CMS momen¬ 
tum below 3.0GeV/c. For both D and D* candidates, 
the candidate D mass and D* — D mass difference must 
be within 1.5 standard deviations of the nominal D mass 
and D* — D mass difference, respectively. The resolution 
is asymmetric and is taken from simulated data. 

The missing mass squared, must lie between 

—0.2 and 8.0GeV^/c^. The momentum transfer q^ = 
(pb — PomY the signal side is required to be 
greater than 4.0GeV^/c^, which suppresses the other¬ 
wise overwhelming contribution from semileptonic B me¬ 
son decays to light leptons. Events with a remain¬ 
ing 7r° candidate are rejected if the energy of either 
daughter photon exceeds 50/100/150 MeV in the bar¬ 
rel/forward/backward region. The overall charge of the 
event must be zero, with no additional charged tracks 
allowed. 

If there are several Stag candidates, the one with the 


most signal like neural-network quality estimator is se¬ 
lected. Then, on average, we have 1.23 signal or nor¬ 
malization candidates per event, with most ambiguities 
arising from D* meson decays to a Z? meson and a neutral 
pion or photon in B —>■ decays. In a multicandi¬ 

date event, we select one at random. 

SIMULATION 

We use samples of simulated (MC) events to study 
backgrounds, to optimize the selection criteria, and to 
determine the probability density function (PDF) shapes 
of the fit components. The decay chains in all simulated 
data are generated with the EvtCen package; the 
GEANT3 [33] framework is used to simulate the detector 
response. A luminosity-weighted run-dependent sample 
of 10^ events for each of the four signatures is generated 
for the signal mode B —>■ using the decay model 

described in Ref. |Mj . To investigate possible new physics 
effects, we produce a sample of simulated B —>■ D^*'>T~i>r 
signal events for the scenario of a two-Higgs-doublet 
model of type II with tan/3/m//+ = 0.5c^/GeV [24] . 
A sample that corresponds to 5 times the amount of 
recorded data and contains BB events with B mesons 
decaying generically via b ^ c transitions as well as qq 
events with q € {u, d, s, c} is used for the background. 

Several corrections are applied to the background MC 
sample to improve its agreement with measured data. 
We first reweight the MC events to account for the im¬ 
perfect estimate of the proportions of correctly recon¬ 
structed Stag candidates and to better estimate the yields 
of background processes with good tags. (The reweight¬ 
ing cancels to first order in the efficiency ratio used to 
extract R{D^*^).) The weights are given by the ratios 
of yields in simulation and data, determined from fits 
to the distributions of Mbc and for events with 

a Stag and a semileptonic decay on the signal side [25j . 
The correction factors are in the range 0.35 to 1.1, with 
an overall factor of approximately 0.75. To extract 
correction factors for the number of incorrectly recon¬ 
structed Stag candidates, we compare yields of simulated 
and reconstructed data in a sideband of Mbc, requiring 
5.23GeV/c^ < Mbc < 5.25GeV/c^. This is done sepa¬ 
rately for the four signal modes, and we exclude events 
with fake D^*'^ mesons or fake leptons on the signal side 
as these are corrected by other measures. The ratios of 
the yields, whose values are between 0.99 and 1.14, are 
then applied as weights. 

Second, we apply a correction for the signal-lepton can¬ 
didates to account for differing misidentification rates in 
simulated and recorded data. Gorrection factors for the 
lepton candidate are provided in eight (eleven) bins in po¬ 
lar angle (momentum). (Lepton-identification efficiencies 
are compatible, within uncertainties, between simulated 
and recorded data.) 
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Third, we reweight the events to account for D^*'> yield 
differences in MC and data. While the yield of candi¬ 
dates with a fake D meson will be estimated from side¬ 
bands and therefore does not need to be corrected in 
simulated data, differences in correctly reconstructed D 
yields can affect the determination of R. We determine 
the yield ratios of simulated and reconstructed data by 
fitting the invariant mass (mass difference) distributions 
of the D {D*) mesons in a wider window than used for 
the nominal selection and apply the ratios as weights. 
This is done individually for each meson reconstruc¬ 
tion channel and yields correction factors between 0.75 
and 1.09. Background MC events with Dj i~Vi de¬ 
cays are reweighted to adopt the latest branching fraction 
measurements [5]. 

Fourth, semileptonic decays of B mesons to higher ex¬ 
citations of D mesons, hereinafter labelled D**, com¬ 
prise one of the most challenging backgrounds. Our 
background MC sample contains semileptonic—including 
semitauonic —B decays to D^, Dq, Di, D[, and the ra¬ 
dial excitations D{2S) and D*{2S), each in the charged 
and neutral variety. The decays are generated initially 
according to the ISGW model [2S] and reweighted to re¬ 
produce the distributions in and (the lepton mo¬ 
mentum in the signal-i? frame) of the LLSW model [21] . 
Parameter uncertainties in this model are treated as sys¬ 
tematic uncertainties. We consider D** decays to a 
and one or two pions, a p, or an ry meson, with branch¬ 
ing ratio assumptions based on quantum-number, phase- 
space, and isospin arguments. Similar weights are ap¬ 
plied to B ^ events in the background MC 

according to the most recent measurements of the form 
factors p2 = 1.207 ± 0.015 ± 0.021, Ri = 1.403 ± 0.033, 
and i ?2 = 0.854 ± 0.020 for B —>• and = 

1.186 ± 0.036 ±0.041 for B glEH]. 


SAMPLE COMPOSITION 

We identify the following components in the data sam¬ 
ples: 


tau signal: This component, arising from B 

dWt- Vr decays, has a correctly reconstructed 
daughter and a correctly identified r-decay 
daughter lepton. With three final-state neutrinos, 
its broad distribution is most prominent in 

regions above l.OGeV^/c^. The yield F)-^fg„ai of 
the tau signal in each Di~ sample is determined 
by the branching-fraction ratio R{D), which is a 
free parameter in the fit, the corresponding lepton 
normalization yield ^'od the efficiency ra¬ 

tio for the lepton normalization and tau signal 
components: 

= R{D)Y/i-'JS ) ■ ( 4 ) 


The factor of 2 accounts for the inclusion of both 
electrons and muons in the lepton normalization 
component. The efficiency ratios, which include 
the T~ — >■ branching fractions |2], are deter¬ 

mined from simulation to be = 1.69±0.09 and 
= 1.91 ± 0.06, where the uncertainties are sta¬ 
tistical. In a similar way, the tau signal yield in the 
D*i~ samples is given by the floating fit parameter 
R{D*) and the corresponding lepton normalization 
yield. However, to encompass larger yields and thus 
obtain smaller statistical uncertainties, the cross¬ 
feeds are added to the tau signal and lepton nor¬ 
malization with the concomitant use of an effective 
efficiency ratio , defined by 


1 


/eff 


1 - XCF 


a:cF 


fD* + -0 ’ 

/CF 


( 5 ) 


where xqy is the fraction of lepton cross-feed events 
relative to the sum of lepton normalization and lep¬ 
ton cross-feed yields, determined from simulation, 
and ifcp) i® ^he efficiency ratio for the lep¬ 
ton normalization (lepton cross-feed) and tau sig¬ 
nal (tau cross-feed) components. The values of the 
effective efficiency ratios are = 3.11 ± 0.13 

and = 3.63 ± 0.09. 


lepton normalization: This originates from B —>■ 
Di*)t Vi decays and has both visible (i.e., non¬ 
neutrino) daughters of the B meson correctly re¬ 
constructed with a distinctive distribution 

that peaks around zero. Its yield is a free parame¬ 
ter of the fit. 

lepton cross-feed: This arises from the misclassifica- 
tion of a H —?► D*£~Vi decay into the D£~ sample 
with same H-meson charge due to the loss of a low- 
energetic 7r° or 7 daughter of the D*. The broad 
^miss distribution peaks at positive values up to 
roughly 1.0 GeV^/c^. Its yield is allowed to float in 
the fit. 


tau cross-feed: This component is the analogue to 
the lepton cross-feed but originating from B —>• 
D*t~Dt decays. Its yield and shape in are 

quite similar to those of the tau signal component. 
It appears only in the Di~ samples and its yield 
is constrained by the r signal yield Tr^ignai in the 
respective D*i~ samples of same charge, assuming 
a 7r° or 7 from the D* decay is not reconstructed. 
The constraining factor is taken from the appro¬ 
priate lepton normalization and lepton cross-feed 
yields and is calibrated by a factor g that represents 
the cross-feed ratio for light-lepton and r modes; 
MC gives 5 + = 0.83 ± 0.08 for the D^l~ sample 
and 5 ° = 0.69 ±0.04 for the D^l~ sample. The tau 
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cross-feed yield is given by 

1 

YD^’°e~ ^ ^ecF _^ 

-'rCF ''rsignal yD* + .o^- „+,0 ' I”! 

wrong-charge lepton cross-feed: This component is 
similar to lepton cross-feed but arises from the loss 
of the charged pion in £>*+ —>■ D°7r+. To preserve 
the overall neutral charge of the event, the lost pion 
is absorbed into the now-misreconstructed i^tag me¬ 
son. (Since D*'^ mesons do not decay to charged 
pions, this component appears only in the D^£~ 
sample.) Its distribution resembles that of 

lepton cross-feed. Its smaller yield is constrained in 
the fit relative to the lepton normalization yield in 
the D*~^ sample with a factor = 0.107 ± 0.004, 
taken from simulation. 

fake !)(*): This component is dominated by random 
combinations of final-state particles that form a 
fake D or D* meson. This can happen by either 
missing particles in the event or misassigning par¬ 
ticles to the wrong B meson. This background oc¬ 
curs in all samples and, in the D*i~ samples, in¬ 
cludes combinations of a correctly reconstructed D 
meson and an incorrect D* primary daughter. The 
distribution is very broad and extends to the 
highest values. 

The fake D^*'> yield is estimated separately for each 
D^*'^ decay mode. A sideband region is defined in 
the distribution of the invariant mass Md (the H*- 
D mass difference AMd-^d) by excluding twice the 
signal-region width on both sides of the nominal 
mass (mass difference) and a ±60 MeV/c^ win¬ 
dow around the D* peak position for the D'^ —>■ 
AgTr+TT^ channel. Multiplying the sideband yield 
in the real data by the yield ratio in MC of the 
fake D^*') component and sideband provides an es¬ 
timate for the fake yield in each D^*'> decay 
mode; these are summed to obtain the total yield 
in each of the four data samples. 

D** background: This component contains candidates 
that originate from B —>■ D**vi{vrVT) decays. 
The higher-excitation D states decay typically to a 
D or D* meson plus one pion (although more pions 
are possible) so the final state here has a properly 
identified lepton, a properly reconstructed me¬ 
son, and (at least) one pion that might be lost or 
absorbed into i?tag- If the pion is missed, this pro¬ 
cess mimics the tau signal and exhibits a similar 
distribution. The yields of this background 
and the tau signal are comparable. In contrast to 
the other background components, it is not possible 
to constrain the yield from MC since the properties 


of i? —>■ D**£~v^ and D** decays are not known re¬ 
liably. Thus, its yield is a free parameter in the 
fit. 

fake lepton: This component contains events with a 
misidentified lepton candidate; the track is usu¬ 
ally a kaon or pion from the tag side or from 
B DK or D-k decays. This component also 
includes B D^*^T~Vr events with a misidenti¬ 
fied pion from a hadronic r decay. Since lepton 
misidentification is far less probable than D^*'> mis- 
reconstruction, a event in which both are 

misidentified is classified in the fake D^*'> compo¬ 
nent. The fake lepton background is a broad struc¬ 
ture in that appears in all four data samples; 

the fixed and relatively low yield is estimated ac¬ 
curately from MC. 

Dg decay: This component arises from the decay chain 
B —>• with Dj —>• £~vi(yTVr) and so has 

a final state that mimics the tau signal. The de¬ 
cay Dj —>• £~Di is helicity suppressed and only the 
tauonic Dj decays provide a non-negligible con¬ 
tribution. Its distribution resembles that of 

the tau signal; its low yield in MC is confirmed by 
experiment, with the most precise determination 
provided by Belle [53] ■ Consequently, this compo¬ 
nent’s yield is fixed in the fit to the MC value. 

rest: This component encompasses all background can¬ 
didates that are not captured by the other listed 
components. It contains candidates with well- 
identified final state particles that do not originate 
from one of the previously covered sources and may 
be random combinations of tag- and signal-side 
particles. Its yield is quite low in all four samples 
and is fixed in the fit to the MC value. 

Table |l] itemizes each component in the fit for each sig¬ 
nature. The yields of the fixed components are listed in 
Table m 

FIT PROCEDURE 

As explained above, the low-M^^j^g region is domi¬ 
nated by the lepton normalization and has essentially 
no sensitivity to the tau signal; in contrast, the high- 
region, where the tau signal is concentrated, ex¬ 
hibits little discrimination power in between the 

tau signal and the other backgrounds—in particular, the 
D** background. Therefore, we fit simultaneously the 
distribution below O.ShCeV^/c"^ to constrain the 
lepton normalization and lepton cross-feed yields and a 
neural-network output onb above 0.85GeV^/c'^ to con¬ 
strain the yields of the other components. (In fact, all 
components are fit in both regions.) The partition at 
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TABLE I. Fit components in each data sample. For the 
yield source, “fit” indicates a free parameter in the fit; “con¬ 
strained” reflects a dependence on other parameters; “MC” 
denotes a fixed yield taken from simulation; and “SB” iden¬ 
tifies a fixed yield derived from the corresponding sideband. 
The constraints are described in the text. 


Component 

D+£- 


D*+£- 


Yield source 

£ normalization 

/ 

/ 

/ 

/ 

Fit 

£ CF 

/ 

/ 

- 

- 

Fit 

r signal 

/ 

/ 

/ 

/ 

Fit 

r CF 

/ 

/ 

- 

- 

Constrained 

Wrong charge £ CF 

- 

/ 

- 

- 

Constrained 

Fake D 

/ 

/ 

- 

- 

Md SB 

Fake D* 

- 

- 

/ 

/ 

AMd*d SB 

D** background 

/ 

/ 

/ 

/ 

Fit 

Fake £ 

/ 

/ 

/ 

/ 

MC 

Ds decay 

/ 

/ 

/ 

/ 

MC 

Rest 

/ 

/ 

/ 

/ 

MC 


TABLE If. Yields for the fixed components in the four data 
samples. 



D+£- 

D°£- 

D*+£- 

D*0£- 

Fake 

350 

1330 

180 

2220 

Fake £ 

20.9 

69 

13.7 

12.9 

Ds decay 

22.0 

112 

21.0 

20.7 

Rest 

23.6 

77 

4.3 

4.2 

M^. = 

miss 

0.85GeVVc4 

minimizes 

the expected 

uncer- 


tainty on R{D) and R{D*). 

The aforementioned neural network is trained for each 
of the four data samples with simulated events to dis¬ 
tinguish the tau signal from the backgrounds in the 
high-M^jgg region; mainly D** background but also the 
wrong-charge cross-feed, fake lepton, Ds decay, and rest 
components. The neural network incorporates and 

several other observables that provide the desired signal- 
to-background separation. The most powerful observ¬ 
able is Eecl, the unassociated energy in the ECL that 
aggregates all clusters that are not associated with recon¬ 
structed particles (including bremsstrahlung). A nonzero 
Aecl value indicates a missing physical process in the 
event, such as a decay mode with a 7r° in which only 
a single daughter photon is reconstructed. Two addi¬ 
tional network inputs are and p|; their additional 
discriminating power is limited by their strong correla¬ 
tion with Other input variables, which provide 

marginally more discrimination, are the number of unas¬ 
signed 7r° candidates with |S'.y.y| < 5.0; the cosine of the 
angle between the momentum and vertex displacement 
of the meson; and the decay-channel identihers of 
the B and mesons. 

For use in the fit, the neural-network output Onb is 


transformed into 


Onb = iog 


Onb — Omin 
Omax — Onb 


(7) 


where the parameters Omin and Omax are the minimum 
and maximum network output values, respectively, in 
the elected data sample. The o^g distributions have 
smoother shapes and can be described well with bifur¬ 
cated Gaussian functions, which makes their parameter- 
izations more robust. 

For each fit component within a selected data sam¬ 
ple, two PDFs are determined: in for M^jgg < 

0.85GeV^/c‘^ and in o(,jg for MAgg > 0.85 GeV^/c"'. 
The PDFs of M^jgg are represented by smoothed his¬ 
tograms and are constructed by applying a smoothing 
algorithm m to the respective MG distributions. Each 
bifurcated-Gaussian PDF in o^g is parameterized by the 
mean, left width and right width, which are determined 
by an unbinned maximum likelihood fit to the MC dis¬ 
tribution. In the fit, each component has a total yield, 
defined in Table |lj with partial yields in the lower- and 
upper-M^jgg regions that are fixed MC-determined frac¬ 
tions of the total yield. 

We maximize the extended likelihood function 


^=n 


Ki 




ki^l 


( 8 ) 


where i G D°i~, D*'^£~, D*^£~} is the data- 

sample index, Q{Ni, K^) is the Poisson probability to ob¬ 
serve Ki events for an expectation value of ^ Yi j 

i 

events (with Yi^j being the yield of component j in data 
sample f), and the vector holds the values for M^jgg 
and o^g of candidate ki. The PDF Vi of data sample i 
is given by 

’7’i(A?jnissi Onb) = ^ ^ \fi,j,lo-wVi,j,\o-w{Miniss) + 

* j 

(7 ~ /Li,low)7^z,i,high(oNB)] ■ (9) 


The index j runs over the components and is 

the fraction of events of the component j that are in 
the lower M^jgg range. The one-dimensional probability 
density function T^ijjow (’Pi,j,high) represents the 
(o^g) distribution in the low- (high-)MAgg region. 

The simultaneous ht over all four data samples has 
twelve free parameters: the lepton normalization yield 
per sample, the lepton cross-feed yield per D£~ sample, 
the D** background yield per sample, and the branching- 
fraction ratios R{D) and R{D*). Here, we assume isospin 
symmetry and use the same R{D) and R{D*) parameters 
for the and B~ samples. 















CROSS-CHECKS 

The implementation of the fit procedure is tested by 
applying the same procedure to multiple subsets of the 
available simulated data. The fit accuracies are evalu¬ 
ated using sets of 500 pseudoexperiments and show no 
significant bias in any measured quantity. These are used 
also to test the influence on the fit result of the value of 
-^miss = 0-85 GeV^/c"^ that is used to partition the sam¬ 
ples: variation of this value reduces the precision of the 
fit result but does not introduce any bias. 

Further tests address the compatibility of the simu¬ 
lated and recorded data. To test resolution modelling, 
we use a sample of events with < 3.5 GeV^/c^, dom¬ 
inated by i? —>■ decays. As the D** back¬ 

ground is one of the most important components—with 
a large potential for flaws in its modeling—we evaluate 
its distributions in more depth by reconstructing a data 
sample with enriched B —> content by requir¬ 

ing a signal-like event but with an additional tt^. The 
background-enriched data samples are fit individually in 
four dimensions separately: Eecl, 

and p^, where is the missing mass of the can¬ 

didate, calculated without the additional 7r°. The shapes 
of the components are extracted from simulated data. 
In each of the four D^*H~Tr^ samples, consistent yields 
are obtained from the fits to all four variables, indicating 
that the simulation describes faithfully the distribution 
in all tested dimensions. 


RESULTS 

The fit to the entire data sample gives 

= 0.375 ± 0.064 (10) 

A(D*) = 0.293 ± 0.038 , (11) 

corresponding to a yield of 320 B —>■ and 503 

B —>■ D*t~Vt events; the errors are statistical. Projec¬ 
tions of the fit are shown in Figs. and The high- 
distributions and the fit projections are shown 
in Fig. 1^ Figures H and show the signal-enhanced 
> 2.0GeV^/(r) ht projections in Aecl (the most 
powerful classiher in the neural network) and , respec¬ 
tively. In these figures, all background components ex¬ 
cept D** background are combined into the other-BG 
component for clarity. The best-fit yields are given in 

Table mil 

From the fit, the correlation between R and R* is 
—0.56; each, in turn, is most strongly correlated with 
the U** background yields, with 0.1 to 0.2 for R and 
«0.3 for R*. 


TABLE III. Fit results and expected yields as derived from 
simulated data. 


Sample 

Component 

Yield 

Expected yield 

D+r 

£ normalization 

844 ± 34 

870 

D+r 

£ CF 

924 ± 47 

970 

D+r 

D** BG 

108 ± 38 

133 

D°£- 

£ normalization 

2303 ± 64 

2290 

D°£- 

£ CF 

7324 ± 122 

7440 

D°£- 

D** BG 

131 ±81 

210 

D*+£- 

£ normalization 

1609 ± 43 

1680 

D*+£- 

D** BG 

36± 18 

76 

D*0£- 

£ normalization 

2188 ± 60 

2280 

D*°£- 

D** BG 

117 ±39 

40 


SYSTEMATIC UNCERTAINTIES 

The dominant systematic uncertainties arise from our 
limited understanding of the D** background and from 
uncertainties in the fixed factors used in the fit. They 
are summarized in Table m and itemized below. 

TABLE IV. Overview of relative systematic uncertainties in 
percent. The last column gives the correlation between R{D) 
and R{D*). 


R{D) [%] R{D*) [%] Correlation 


shapes 

4.2 

1.5 

0.04 

D** composition 

1.3 

3.0 

-0.63 

Fake D yield 

0.5 

0.3 

0.13 

Fake £ yield 

0.5 

0.6 

-0.66 

Dg yield 

0.1 

0.1 

-0.85 

Rest yield 

0.1 

0.0 

-0.70 

Efficiency ratio 

2.5 

0.7 

-0.98 

D ^ 

Efficiency ratio / 

1.8 

0.4 

0.86 

Efficiency ratio ^ 

1.3 

2.5 

-0.99 

T~) ^ 

Efficiency ratio Rg 

0.7 

1.1 

0.94 

CF double ratio g+ 

2.2 

2.0 

-1.00 

CF double ratio 

1.7 

1.0 

-1.00 

Efficiency ratio /wc 

0.0 

0.0 

0.84 

Afmias shape 

0.6 

1.0 

0.00 

Onb shape 

3.2 

0.8 

0.00 

Lepton PID efficiency 

0.5 

0.5 

1.00 

Total 

7.1 

5.2 

-0.32 


In the table, £v shapes” refers to uncertainties 

in the parameters that are used for the shape reweight¬ 
ing of semileptonic decays. The effect on the result is 
extracted by creating different sets of weights according 
to shape hypotheses from varying individual production 
parameters within their Icr limits. 

The D** background has a strong influence on the 
extracted yield of the tau signal because the two com¬ 
ponents overlap in the Af^jgg spectrum. In addition to 
the shape uncertainties, there are uncertainties related 
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m: 


.(GeV^/c'*) 




Left: distribution for M^jgg < 0.85 GeV^/c'*; right: Onb distribution for M^jgg > 0.85 GeV^/c'*. 


to the poorly determined branching fractions to the dif¬ 
ferent D** states. The fit is therefore repeated several 
times: twice for each D** state, with its branching frac¬ 
tions varied within its uncertainties. We use the follow¬ 
ing uncertainties: 42.3% for 34.6% for Z?q, 14.9% 
for Di, 36.2% for and 100.0% for the radially ex¬ 
cited D{2S) and D*{2S). The best-fit variations in R 
are used as systematic uncertainties. They are combined 
quadratically and quoted in Table [rv| as “£)** composi¬ 
tion.” 

All fixed factors used in the fit are varied by their un¬ 
certainty (arising from the MC sample size). The influ¬ 


ence of the uncertainty of these factors is shown indi¬ 
vidually in Table IV Most factors—especially the fixed 
yields—have little influence on the overall uncertainty; 
the efficiency ratios and and the cross¬ 

feed probability ratios give the largest contributions, 
comparable to the D** composition and shape 

uncertainties. 


To evaluate the effect of PDF uncertainties, the shapes 
of all components are modified and the fit is repeated. 
The nominal fit uses smoothed-histogram PDFs in 
here, these are replaced by unsmoothed-histogram PDFs. 
The variation of the best-fit R is taken as the symmetric 
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FIG. 2. Fit projections and data points with statistical uncertainties in the D*^£ (top) and _D*°f (bottom) data samples. 
Left: distribution for < 0.85 GeV^/c'*; right: Onb distribution for M^jgg > 0.85 GeV^/c'*. 


systematic uncertainty for shape” in Table IV 


For the 0(^3 alternate model, we replace the bifurcated 
Gaussians by kernel-estimator functions with adaptive 
bandwidth. Again, the deviation from the nominal fit 
value is taken as the symmetric systematic uncertainty 
for “ 0^3 shape” in Table IV It is among the dominant 
systematic uncertainties. 


The identification efficiencies for primary and sec¬ 
ondary leptons are slightly different between simulated 
and real data. This difference affects the measurement 
by modifying the efficiency ratios. It has been calibrated 
for different lepton kinematics and run conditions using 


J/V' —t decays, leading to a 0.5% relative uncer¬ 

tainty in R{D) and R{D*). 


The correlations of R{D) and R{D*) for each item¬ 
ized systematic-uncertainty contribution are given in the 
last column of Table W These are calculated using 500 
pseudoexperiments, with two exceptions: the shape un¬ 
certainties are assumed to be uncorrelated while the lep¬ 
ton ID efficiencies are assumed to be 100% correlated 
between R{D) and R{D*). The total correlation of the 
systematic uncertainties is —0.32. 
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FIG. 3. Projections of the fit results and data points with statistical uncertainties for the high region. Top left: ; 

top right: bottom left: D^i~; bottom right: D*^£~. 


RESULTS AND DISCUSSION 

The best-fit results, including systematic uncertainties, 
are 

i?(£>) = 0.375 ± 0.064 ± 0.026 (12) 

= 0.293 ±0.038 ±0.015 . (13) 

Figure]^ shows the exclusion level in the R{D)-R{D*) 
plane, based on the likelihood distribution that is con¬ 
voluted with a correlated two-dimensional normal distri¬ 
bution according to the systematic uncertainties. The 
exclusions of the central values of the BaBar mea¬ 


surement m and the SM prediction as determined in 
Ref. m are comparably low at lAcr and 1.8 ct, respec¬ 
tively. While our measurement does not favor one over 
the other, both measurements deviate in the same direc¬ 
tion from the SM expectation. 


We also use our ht procedure to test the compatibility 
of the data samples with the two-Higgs-doublet model of 
type II. For this purpose, we perform the analysis with 
the 2HDM MC sample with tan/3/TO//+ = 0.5c^/GeV 
to extract probability density distributions. The best-fit 
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EECL{GeV) EECL{GeV) 


FIG. 4. Projections of the fit results and data points with statistical uncertainties in a signal-enhanced region of > 

2.0GeV^/c"^ in the Secl dimension. Top left: D^l~\ top right: bottom left: D^£~; bottom right: 


values in this alternate model are 

R{D) = 0.329 ± 0.060(stat.) ± 0.022(syst.) (14) 

R{D*) = 0.301 ± 0.039(stat.) ± 0.015(syst.) . (15) 

The effect on the measured R{D*) value is very small 
but the measured value for R{D) is significantly lower. 
For the prediction in the 2HDM of type II, we use for¬ 
mula (20) in Ref. |11) : the expected values are 

i?(£>)2HDM = 0.590 ±0.125 (16) 

R{D* )2HDM — 0.241 ± 0.007 . (17) 

Figure]^ shows the predictions of R{D) and R{D*) as a 


function of tan/3/m//+ for the type II 2HDM, together 
with our results for the two studied values of 0 (SM) 
and 0.5c^/GeV. In contrast to BaBar’s measurements, 
our results are compatible with the type II 2DHM in the 
ta,nl3/mH+ regions around 0.45c^/GeV and zero. 

The observable most sensitive to NP extensions of the 
SM with a scalar charged Higgs is We estimate the 
signal distributions by subtracting the background, us¬ 
ing the distributions from simulated data and the yields 
from the fit procedure, and correcting the distributions 
using efficiency estimations from simulated data. The 
D'^£~ and samples and the D*^i~ and D*^t~ 
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FIG. 5. Projections of the fit results and data points with statistical uncertainties in a signal-enhanced region of > 

2.0GeV^/c"^ in the dimension. Top left: D^l~\ top right: bottom left: D^£~; bottom right: D*°£~. 


samples are combined to increase the available statistics, 
then the full procedure is repeated using the assump¬ 
tions for the T signal in a type II 2HDM model with 
t&nl3/mH+ = 0.5c^/GeV. Figure]^ shows the measured 
background-subtracted and efficiency-corrected distri¬ 
butions for the SM and the NP point. As the signal yields 
are not extracted from fits to individual q'^ bins, the data 
distribution depends slightly on the signal model; the sig¬ 
nal model can affect the background yields in the fit to 
uncorrected data, which are then subtracted. A test 
shows that both hypotheses are compatible with our data 
with p-values for the SM distribution of 64% {DT~Dr) 


and 11% {D*t i^r), and for the NP distribution of 53% 
{Dt~Vt) and 49% {D*T~Vr). 

CONCLUSION 

We present a measurement of the relative branching 
ratios of B ^ D^*'>T~i?r to B ^ using 

the full T (45') data recorded with the Belle detector. The 
results are 

R{D) = 0.375 ± 0.064(stat.) ± 0.026(syst.) 

R{D*) = 0.293 ± 0.038(stat.) ± 0.015(syst.) . 
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FIG. 6. Exclusion level of R{D)-R{D*) value assumptions in 
standard deviations, systematic uncertainties included. 



FIG. 7. Theoretical predictions with 1 a error ranges for R{D) 
(red) and R{D*) (blue) for different values of tan (3jmjj+ 
in the 2HDM of type II. The fit results for tan/3/mj:f+ = 
0.5c^/GeV and SM are shown with their 1 cr ranges as red 
and blue bars with arbitrary width for better visibility. 


In comparison to our previous preliminary results [3, 
which are superseded by this measurement, we utilize 
a more sophisticated fit strategy with an improved han¬ 
dling of the background from B —)■ D**£~Dg events, im¬ 
pose an isospin constraint, and exploit a much higher 
tagging efficiency. By these methods, we reduce the sta¬ 
tistical uncertainties by about a third and the systematic 
uncertainties by more than a half. 

Our result lies between the SM expectation and the 
most recent measurement from the BaBar collabora¬ 


tion m and is compatible with both. It is also com¬ 
patible with a 2HDM of type II in the region around 
ta,nl3/mH+ = 0.5c^/GeV, as illustrated in Figs. [^and|^ 
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FIG. 8. Background-subtracted distributions of the r signal in the region of > 0.85 GeV^/c'^. The distributions are 

efficiency corrected and normalized to the htted yield. The error bars show the statistical uncertainties. The histogram is the 
respective expected distribution from signal MG. Left: Standard Model result, right: Type-II 2HDM result with taxi j3 /m^+ = 
0.5c^/GeV, top: B —>■ Dt~Vt, bottom: B —> D*T~Vr 


[1] Throughout this paper, the inclusion of the charge- 
conjugate decay mode is implied. 

[2] K. A. Olive et al. (Particle Data Group Collaboration), 
Chin. Phys. C 38, 090001 (2014). 

[3] J. Chay, H. Georgi and B. Grinstein, Phys. Lett. B 247, 
399 (1990); 1. 1. Y. Bigi, M. A. Shifman, N. G. Uraltsev 
and A. 1. Vainshtein, Phys. Rev. Lett. 71, 496 (1993); 
C. W. Bauer, Z. Ligeti, M. Luke, A. V. Manohar and 
M. Trott, Phys. Rev. D 70, 094017 (2004). 

[4] Y. Amhis et al. (Heavy Flavor Averaging Group (HFAG) 
Collaboration), arXiv: 1412.7515 [hep-ex]. 

[5] M. Tanaka, Z. Phys. C 67, 321 (1995). 

[6] H. Itoh, S. Komine and Y. Okada, Prog. Theor. Phys. 
114, 179 (2005); U. Nierste, S. Trine and S. West- 
hoff, Phys. Rev. D 78, 015006 (2008); M. Tanaka and 
R. Watanabe, Phys. Rev. D 82, 034027 (2010); S. Fajfer, 
J. F. Kamenik and 1. Nisandzic, Phys. Rev. D 85, 094025 
( 2012 ). 

[7] A. Matyia et al. (Belle Collaboration), Phys. Rev. Lett. 

99, 191807 (2007). 

[8] B. Aubert et al. (BaBar Collaboration), Phys. Rev. Lett. 

100, 021801 (2008). 

[9] 1. Adachi et al. (Belle Collaboration), arXiv:0910.4301 
[hep-ex]. 

[10] A. Bozek et al. (Belle Collaboration), Phys. Rev. D 82, 
072005 (2010). 


[11] J. P. Lees et al. (BaBar Collaboration), Phys. Rev. Lett. 
109, 101802 (2012); J. P. Lees et al. (BaBar Collabora¬ 
tion), Phys. Rev. D 88, 072012 (2013). 

[12] R. Aaij et al. (LHCb Collaboration), Phys. Rev. Lett. 
115, 111803 (2015). 

[13] S. Fajfer, J. F. Kamenik and 1. Nisandzic, Phys. Rev. D 
85, 094025 (2012); J. F. Kamenik and F. Mescia, Phys. 
Rev. D 78, 014003 (2008). 

[14] J. A. Bailey et al. (Fermilab Lattice and MILC Collabo¬ 
rations), Phys. Rev. D 92, 034506 (2015). 

[15] H. Na, C. M. Bouchard, G. P. Lepage, C. Monahan and 
J. Shigemitsu, Phys. Rev. D 92, 054510 (2015). 

[16] S. Kurokawa and E. Kikutani, Nucl. Instrum. Methods 
Phys. Res. Sect. A 499, 1 (2003), and other papers in¬ 
cluded in this volume; T.Abe et al., Prog. Theor. Exp. 
Phys. 2013, 03A001 (2013) and references therein. 

[17] A. Abashian et al. (Belle Collaboration), Nucl. Instrum. 
Methods Phys. Res. Sect. A 479, 117 (2002); also see 
detector section in J.Brodzicka et al., Prog. Theor. Exp. 
Phys. 2012, 04D001 (2012). 

[18] Z. Natkaniec et al. (Belle SVD2 Group), Nucl. Instr. and 
Meth. A 560, 1 (2006). 

[19] M. Feindt et al., Nucl. Instr. and. Meth. A 654, 432 

( 2011 ). 

[20] S. H. Lee et al. (Belle Collaboration), Phys. Rev. Lett. 
91, 261801 (2003). 

[21] K. Sumisawa et al. (Belle Collaboration), Phys. Rev. 
Lett. 95, 061801 (2005). 























































16 


[22] D. J. Lange, Nucl. Instr. and. Meth. A 462, 152 (2001). 

[23] R. Brun et al, Report No. CERN DD/EE/84-1 (1987). 

[24] M. Tanaka and R. Watanabe, Phys. Rev. D 87, 034028 
(2013). 

[25] A. Sibidanov et al. (Belle Collaboration), Phys. Rev. D 
88, 032005 (2013). 

[26] D. Scora and N. Isgur, Phys. Rev. D 52 (1995) 2783. 


[27] A. K. Leibovich, Z. Ligeti, 1. W. Stewart and M. B. Wise, 
Phys. Rev. D 57, 308 (1998). 

[28] 1. Caprini et al, Nucl. Phys. B530, 153 (1998). 

[29] A. Zupanc et al. (Belle Collaboration), J. High Energy 
Phys. 09, 139 (2013). 

[30] V. Blobel, histogram smoothing code, 
http: / / WWW. desy. de/ ' blobel / splft. f. 



